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MISO Purpose 

► Track samples processing into libraries 

► Follow samples (DNA extracts) through library 
preparation and pooling tor sequencing 

► Collect metadata tor SRA submission 

► Monitor sequencer output and provide notifications 
and metrics 

► Allow use of different sequencing platforms, 
including lllumina, PacBio, 454, and Solid 

► Enable inter-team communication and high-level 
reporting 

► Provide a mature, open and extendable alternative for 
lab tracking 



ONTARIO INSTITUTE FOR CANCER RESEARCH 


Background 

MISO is a inventory tracking system tor HTS sequencing labs. 
Currently developed in a highly coordinated cross-institute effort. 

OICR has a HTS sequencing lab 

► Managed through Geospiza GeneSifter LE LIMS 

► Geospiza was getting too expensive and teature requests 
weren’t being added 

► Geospiza is proprietary software and thereby inhibiting to 
improvements to workflows for lab users 

Earlham (formerly TGAC) has a HTS sequencing lab 

► Developed in house LIMS (previously presented) 

► Mostly small feature and maintenance work 

► OICR manager visited and...collaboration! 




Architecture 



OICR 
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Sequencer Output 





Analysis Server 


Earlham 
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Together? Forever? 
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But It Works on My Machine 

► OICR and Earlhann have sinnilar, but distinct needs 

► Earlham mostly handles DNA extracts, OICR 
handles tissue samples, slides, and extracts 

► Similar set of instruments, but OICR mainly uses 
LIMS to track lllumina, Earlham lllumina and 
PacBio 

► OICR has legacy data in Geospiza 

► OICR did not want to fork MISO entirely 

► We worked out a development plan to 
integrate OICR features into Eorlham's mainline 

► Some are useful, some are irrelevant to Earlham 
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Social Aspects 

MISO now actively developed by Eorlhann and OICR 

► 1 developer at Earlham 

► 4 developers at OICR (until Geospiza sunset) 

► All active work visible on GitHub 

► Code reviews for everything with at least two 
developers 

► All features integrated into mainline (site-specific 
reposifories are mostly configuration) 
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New Developments 
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New Features 

► Tissue processing workflow 

► Bulk input via Hondsontable (Excel-like web 
interface) 



Library Alias 

Sample Alias 

Description 

Matrix Barcode 

Platform 

Type 

Selection 

Strategy 

Barcode Kit 

Index 1 

Index 2 

Kit 

Volume 

Cone. 

1 


MGP_0003_Bn_P_nn_l-l_D_Sl 



lllumina 

Mate Pair 

RT-PCR 

AMPLICON 

TruSeq Single Index 

Index 01 - ATCACG 


TruSeq RNA Access 

0.0 

0.00 

2 


MGP 0002 Bn R nn 1-1 D SI 



PacBio 

2kb Shotgun 

Hybrid Selection 

WGS 

No barcode 



KAPA Hyper Prep 

0.0 

0.00 

3 


MGP_0001_Bn_R_nn_l-l_D_Sl 



lllumina 

Single End 

DNAse 

CLONE 

Nextera Dual Index 

N701 - TAAGGCGA 

N501 - TAGATCGC 

Nextera DNA 

0.0 

0.00 
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New Features 

► storage boxes for handling frozen barcoded tubes 

► Scanner integration in the Ul 

Box Use; | DNA 

Box Size: 8 x 12 (can be scanned by your lab's bulk scanner) 

Contents Options ▼ I 

Selected Position; 

Name; 

Alias; 

Barcode: | 11 Lookup 


Show |96 ▼ I entries Search: | [ List all Box Contents 


Position 

Element Name 

Element Alias 





A02 

SAM32 

MGP 0001 Bn 

R 

nn 1- 

■1 D 

1 

A03 

SAM33 

MGP 0003 Bn 

R 

nn 1 

■1 D 

1 

A04 

SAM 14 

MGP 0001 Bn 

P 

nn 1- 

1 D 

SI 

Showing 1 to 3 of 3 entries 


123456789 10 11 12 

oooooooooooo 
B oooooooooooo 

^ OOOOOOOOOOI^HP 

oooooooooooo 
^ oooooooooooo 
^ oooooooooooo 
^ oooooooooooo 
H oooooooooooo 
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New Features 


► 


Order management 

► Track what needs to be sequenced with what 
chemistry 


Orders 


Platform: | lllumina - lllumina HiSeq 2500~ 
SequefKing Parameters: I v3 2x101 


Partitions; |l_|_ 

Platform: | lllumina - lllumina Genome Analyzer ^ 
Sequencing Parameters: 11x151 ▼ [ 

Add 


Order Completion 


Platform 

Parameters 

Failed 

Requested 

Remaining 

lllumina - lllumina HiSeq 2500 

v4 2x126 

2 

lo 

! 0 

lllumina - lllumina HiSeq 2500 

v3 2x101 

0 

1 

1 




Unfulfilled Pool Orders 



1 Show 125 ▼ 

1 entries 



Searchij 

1 

Pool Name 

▼ Alias 

0 Platform 

0 Sequencing Parameters 

0 Requested 

0 Remaining 0 

IPOS 

TEST DOl 4 

lllumina - lllumina HiSeq 2500 

Rapid Run 2x51 

2 

2 

IP03 

TEST Dool 1 

lllumina - lllumina HiSeq 2500 

v3 2x101 

1 

1 

IPOl 

asdfasd 

lllumina - lllumina HiSeq 2500 

v4 2x126 

3 

3 

Showing 1 to 3 of 3 entries 
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New Features 


► Front-end input validation 


oh snap! 

This form seems to be invalid 


► Change logs 


Library Information 


Library ID: 
Location: 
Name: 

Parent Sample: 
Alias:* 


Description:* 
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LIBS 

MGP 0001 Bn R nn 1-1 D 1 fSAM32) 


'MGP_0001_bn_R_PE_301_WG| 


i|[A-ZKl}[a-z]{l})_([nRPXMCFETO])_(SE|PE|MP|\? 


|test 



77 

251 


Changes 


Summary 

Time 

admin has changed: ready: 1-0 

2016-06-17 22:08:08.0 

admin has changed: QC passed: n/a • 1, ready: 0-1 

2016-06-07 14:09:07.0 

adminl has changed: ready: 1 - 0 

2016-04-25 19:24:38.0 

admin created pool. 

2016-04-25 19:19:40.0 

admin Added: LDIl 

2016-04-25 19:19:40.0 
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New Procedures 

Shared development 

► Everything gets code reviewed 

► Improved testing and plans for more 
More plug-and-play 

► Externolizotion of configuration 

► Weekly releases at OICR 

► Fewer Eorlhom-specific assumptions 
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Future Plans 

Tissue processing workflows 

► Currently, OICR-specific; Eorlhom intends to use, so 
will be generalised 

► OICR has variable workflow, so this will expand 
anyway 

Better Ul 

► Excel-like Handsontable used in new 
developments, will be applied to existing data 
import 

► Customisable reporting 
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Availability 

► Source available on GitHub 

► Docker container available for low-effort testing 

► On my laptop! Find me and request a demo 

https://github.com/TGAC/miso-lims 

http://miso-lims.slack.com 

New institutes welcome. 

We're developing a community. 



Team 



Rob Davey (Earlham) 



Chris Salt (Earlham) 



Heather Armstrong (OICR) 



Dillan Cooke (OICR) 
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Tony DeBat (OICR) 



Andre Masella (OICR) 



Morgan Taschuk (OICR) 
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E j Earlham 
I Institute 


COICR 

Ontario Institute 
for Cancer Research 


http://earlham.ac.uk/davey-group 


httpV/bit.do/oicrgsi 
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